Prediction of Genomic Methylation Status on CpG Islands Using DNA Sequence Features

نویسندگان

  • YOICHI YAMADA
  • KENJI SATOU
چکیده

-. In mammals, cytosines of most CpG dinucleotides in their genomes except gene promoters are subject to modification by methyl group (methylation). A number of genes in a mammal are regulated developmentalspecifically or tissue-specifically by the methylation. Mammalian DNA methylation contributes to regulation of gene expression, repression of parasitic sequences, inactivation of X chromosome in female, genomic imprinting, etc. Aberrant methylation results in a part of cancers and genetic diseases in human. Therefore it is required that methylation status on human genome is comprehensively revealed in each kind of cells. However, since comprehensive methylation analyses require a lot of times and large labor, methylation status on only a part of genomic regions is revealed in mammals. Because of this, machine learning using already known methylation data and prediction of methylation status on other genomic regions are important. Moreover, since sequence differences between DNA regions showing different methylation status also remain unclear, those differences should be also determined. Therefore we conducted machine learning by support vector machine using our previously reported methylation data, and predicted methylation status on DNA sequences using DNA sequence features. Furthermore we explored different sequence features among four types of methylation using random forest. Consequently high methylation prediction accuracies were observed between two different methylation status pairs. Moreover it was revealed that sequences containing CG, CT or CA were important for discrimination between them. Key-Words: CpG island, DNA methylation, Human chromosome 11, Human chromosome 21, Support vector machine, Random forest

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting CpG Islands and DNA Methlation in the Cow Genome Using DNA Microarray Meta-Analysis and Genome Wide Scanning

DNA methylation is a type of epigenetic changes that directly affects DNA. In mammals, DNA methylation is essential for fetal development and stem cell differentiation and this phenomenon essentially occurs within the CpG islands. In this study, two methods were used to study the DNA methylation profile of cow genome. In the first method, the DNA methylation profile of the differentially expres...

متن کامل

Prediction of Methylation Status on DNA sequences and Identification of Its Important DNA Sequence Features

-. In mammals, cytosines of most CpG dinucleotides in their genomes except gene promoters are subject to modification by methyl group (methylation). A number of genes in a mammal are regulated developmentalspecifically or tissue-specifically by the methylation. Mammalian DNA methylation contributes to regulation of gene expression, repression of parasitic sequences, inactivation of X chromosome...

متن کامل

Study of promoter CpG island hypermethylation of cyclindependent kinase inhibitor gene p21waf1/cip1 on some breast carcinoma cell lines

The p21 belongs to the CIP/KIP family of CDK inhibitors involved in cell cycle arrest at specific stages of the cell cycle progression. DNA methylation is the best studied epigenetic mark that have been evidently associated to chromatin condensation, and repression of gene transcription. The CpG island hypermethylation in promoter region of certain genes occurs in cancer cells and affects tumor...

متن کامل

Histone methylation marks play important roles in predicting the methylation status of CpG islands.

The methylation status of CpG islands is highly correlated with gene expression. Current methods for computational prediction of DNA methylation only utilize DNA sequence features. In this study, besides 35 DNA sequence features, we added four histone methylation marks to predict the methylation status of CpG islands, and improved the accuracy to 89.94%. Also we applied our model to predict the...

متن کامل

Improved Prediction of Non-methylated Islands in Vertebrates Highlights Different Characteristic Sequence Patterns

Non-methylated islands (NMIs) of DNA are genomic regions that are important for gene regulation and development. A recent study of genome-wide non-methylation data in vertebrates by Long et al. (eLife 2013;2:e00348) has shown that many experimentally identified non-methylated regions do not overlap with classically defined CpG islands which are computationally predicted using simple DNA sequenc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008